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Active Target Defense Differential Game with a Fast Defender 

Eloy Garcia, David W. Casbeer, and Meir Pachter 


Abstract —This paper addresses the active target defense 
differential game where an Attacker missile pursues a Target 
aircraft. A Defender missile is fired hy the Target’s wingman 
in order to intercept the Attacker before it reaches the aircraft. 
Thus, a team is formed hy the Target and the Defender which 
cooperate to maximize the distance between the Target aircraft 
and the point where the Attacker missile is intercepted by 
the Defender missile, while the Attacker tries to minimize 
said distance. The resnlts shown here extend previous work. 
We consider here the case where the Defender is faster than 
the Attacker. The solution to this differential game provides 
optimal heading angles for the Target and the Defender team to 
maximize the terminal separation between Target and Attacker 
and it also provides the optimal heading angle for the Attacker 
to minimize the said distance. 

I. Introduction 

Pursuit-evasion scenarios involving multiple agents rep¬ 
resent important and challenging types of problems in 
aerospace, control, and robotics. They are also useful in order 
to analyze biologically inspired behaviors. For instance, the 
paper [1] addressed a scenario where two evaders employ 
coordinated strategies to evade a single pursuer, but also to 
keep them close to each other. The authors of [2] discussed 
a multi-player pursuit-evasion game with line segment ob¬ 
stacles labeled as the Prey, Protector, and Predator Game. 
Dominance regions were provided for each agent in order 
to solve the game, that is, to determine if the Protector 
is able to rescue the Prey before the Predator captures it. 
A different approach to address pursuit-evasion games with 
several pursuers in order to capture an evader within a 
bounded domain is based on dynamic Voronoi diagrams, as 
in [3] and [4]. 

An scenario of active target defense including three agents, 
the Target (T), the Defender (D), and the Attacker (A), 
has been analyzed in the context of cooperative optimal 
control [5], [6]. Indeed, sensing capabilities of missiles and 
aircraft allow for implementation of complex pursuit and 
evasion strategies [7], [8], and recent work has proposed 
different guidance laws for the agents A and D. In [9] the 
authors addressed the case where the Defender implements 
Command to the Line of Sight (CLOS) guidance to pursue 
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the Attacker which requires the Defender to have at least 
the same speed as the Attacker. A different guidance law for 
the Target-Attacker-Defender (TAD) scenario was given by 
Yamasaki et.al. [10], [11]. These authors investigated an in¬ 
terception method called Triangle Guidance (TG), where the 
objective is to command the defending missile to be on the 
line-of-sight between the attacking missile and the aircraft 
for all time while the aircraft follows some predetermined 
trajectory. The authors show, through simulations, that TG 
provides better performance in terms of Defender control 
effort than a number of variants of Proportional Navigation 
(PN) guidance laws, that is, when the Defender uses PN 
to pursue the Attacker instead of TG. These approaches 
constrain and limit the level of cooperation between the 
Target and the Defender by implementing Defender guidance 
laws without regard to the Target’s trajectory. 

The papers [12], [13] presented an analysis of the end¬ 
game TAD scenario based on the Attacker/Target miss dis¬ 
tance for a non-cooperative Target/Defender. The authors 
develop linearization-based Attacker maneuvers in order to 
evade the Defender and continue pursuing the Target. 

Different types of cooperation have been recently proposed 
in [14], [15], [16], [17], [18], [19], [20] for the TAD scenario. 
In these papers the Target represents an aircraft trying to 
evade a missile homing on it. The Defender represents an¬ 
other missile launched by the aircraft (or a wingman) in order 
to intercept and destroy the Attacker in order to guarantee 
the survival of the aircraft. Thus, in [16] optimal policies 
(lateral acceleration for each agent including the Attacker) 
are provided for the case of an aggressive Defender, that 
is, the Defender has a definite maneuverability advantage. 
A linear quadratic optimal control problem is posed where 
the Defender’s control effort weight is driven to zero to 
increase its aggressiveness. Reference [17] provided a game 
theoretical analysis of the TAD problem using different 
guidance laws for both the Attacker and the Defender. The 
cooperative strategies in [18] allow for a maneuverability 
disadvantage for the Defender with respect to the Attacker 
and the results show that the optimal Target maneuver is 
either constant or arbitrary. In the recent paper [20] the 
authors analyze different types of cooperation assuming the 
Attacker is oblivious of the Defender and its guidance law is 
known. Two different one-way cooperation strategies were 
discussed; when the Defender acts independently, the Target 
knows its future behavior and cooperates with the Defender, 
and vice versa. Two-way cooperation where both Target and 
Defender communicate continuously to exchange their states 
and controls is also addressed, and it is shown to have a 
better performance than the other types of cooperation - as 


expected. 

Our preliminary work [21], [22] considered the cases 
when the Attacker implements typical guidance laws of Pure 
Pursuit (PP) and PN, respectively. In these papers, the Target- 
Defender team solves an optimal control problem that returns 
the optimal strategy for the T — D team so that D intercepts 
the Attacker and at the same time the separation between 
Target and Attacker at the instant of interception of A by D 
is maximized. 

In this paper the active target defense scenario is modeled 
as a zero-sum three-agent pursuit-evasion differential game. 
The two-agent team consists of a Target and a Defender 
who cooperate; the Attacker is the opposition. The goal 
of the Attacker is to capture the Target while the Target 
tries to evade the Attacker and avoid capture. The Target 
cooperates with the Defender which pursues and tries to 
intercept the Attacker before the latter captures the Target. 
Cooperation between the Target and the Defender is such 
that the Defender will capture the Attacker before the latter 
reaches the Target. In this differential game the Attacker also 
solves an optimal control problem in order to minimize the 
final separation between itself and the Target. Assuming that 
the Attacker knows the position of the Defender, this strategy 
provides better performance for the Attacker than using PP or 
PN. From the Attacker’s point of view, it is better to bring the 
Defender-Attacker interception point closer to the Target’s 
position (and hopefully produce some damage), even though 
the Attacker is then captured by the Defender. The present 
paper extends the results in [23] where it was assumed that 
both missiles, the Attacker and the Defender, have the same 
speed. Here, we extend the analysis of this differential game 
to include the operationally relevant case where the Attacker 
and the Defender missiles have different speeds; the focus 
of this paper is on the case where the Defender is faster 
than the Attacker. This scenario is more complex than the 
previously considered particular case of same speeds. Here, 
we derive the optimal strategies for each one of the three 
agents. In addition, given a Defender-Attacker speed ratio we 
provide the critical Target/Attacker speed ratio to guarantee 
its survival. 

We also obtain the analytical solutions of the differential 
game, and give special attention to the case where the Target 
starts closer to the Attacker than to the Defender. For this 
scenario we also provide the critical minimal speed of the 
Target for it to avoid capture; that is, when the Target starts 
closer to the Attacker than to the Defender, its speed must be 
bounded from below; otherwise the Target will be captured 
by the Attacker before the Defender can get in the way of 
the Attacker and intercept it. 

The paper is organized as follows. Section |II] states the 
active target defense differential game. Section HID provides 
a numerical method to solve the differential game. The 
minimum Target speed ratio to evade capture by the Attacker 
is given in Section HV] Analytical solutions of the differential 
game are provided in Section |V] Examples are given in 
Section |VT] and concluding remarks are made in Section [Vnl 



II. Differential Game 

The target defense differential game is illustrated in Fig. [T] 
The speeds of the Target, Attacker, and Defender are denoted 
by Vt, Va, and Vd, respectively, which are assumed to be 
constant. The agents have “simple motion” a la Isaacs. The 
dynamics of the three vehicles in the realistic game space 
are given by: 


XT =Vt cos (j), 

yr = Vt sin p 

(1) 

XA = Va cos X, 

ijA = Va sin x 

(2) 

xd = Vd cost/), 

i/D = Vd sin tp 

(3) 


where the headings of the Target, the Attacker, and the 
Defender are, respectively, cf) = ^-l-A, x = A-l-0 — %, and 
tp = tp + 9 + X — n. 

The variables R and r represent the separation between 
the Attacker and the Target and between the Attacker and the 
Defender, respectively. In this game the Attacker pursues the 
Target and tries to capture it. The Target and the Defender 
cooperate in order for the Defender to intercept the Attacker 
before the latter captures the Target. Thus, the Target- 
Defender team search for a cooperative optimal strategy 
to maximize R{tf) which represents the distance between 
the Target and the Attacker at the time instant // of the 
Defender capturing the Attacker. The Attacker will search 
for its corresponding optimal strategy in order to minimize 
R{tf). 

Define the speed ratio problem parameter a = VtIVa- 
In general, we have that the Attacker missile is faster than 
the Target aircraft, so a < 1. Let us define the speed ratio 
P = VdIVa- When the Defender is faster than the Attacker 
we have that P > \. 

III. Numerical Solution 

In this section, the corresponding dynamics of the three- 
agent engagement will be modeled using the reduced state 
space formed by the ranges R and r, and by the angle 
between them, denoted by 0 see Fig. [T] The objective 
of the Target-Defender team is to determine their optimal 
heading angles p and p in this reduced state space such 
that the distance R{tf) is maximized at the time instant 




tf where the separation r{tf) = r^, where Tc denotes the 
Defender’s capture radius. The interception time f/ is free. 
The objective of the Attacker is to determine its optimal 
heading angle, denoted by such that the distance R{tf) 
is minimized. Note that the relative heading angles can be 
easily transformed to heading angles with respect to the 
fixed coordinate axis x using the line of sight angle from 
the Attacker to the Target, denoted by A. The use of the 
reduced state space provides a compact representation of the 
dynamics of this three-agent differential game. 

The (normalized with respect to the speed Va) dynamics 
in the reduced state space are 
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for 0 < t < tf. 

The objective of the Target-Defender team is to maximize 
the separation between the Target and the Attacker at the 
interception time R{tf), where the terminal time tf is free, 
such that r{tf) = Xc- The objective of the Attacker is to 
minimize the same distance R{tf). This can be expressed as 

■ 

max min J= / Rdt. (7) 

X Jtg 

Then, the Hamiltonian is given by (where the Target- 
Defender team aims at minimizing —J and the Attacker 
aims at maximizing — J, for convenience of notation of the 
solutions): 

H = cos(0 — x) — ot cos cj) 

+ {a cos cj) — cos(0 — x))A/i 
— (cos X + /3 cos tp) Xr 

+ {-^sm(j)+j^sm{e-x)-^smtp + ^smx)Xe 

( 8 ) 


and the co-state dynamics are given by: 

- X) - a sin </>) (9) 

AV = ^ (sinx -/3sin'0) (10) 

Xe = (1 - AH)sin(6» - x) - ^cos(6<-x)- (H) 

The terminal conditions for this free terminal time problem 
are as follows. The terminal state r{tf) is fixed and equal 
to Tc- Because the terminal states R{tf), and 9{tf) are 
free, we have XR{tf) = Xg{tf) = 0. The final termi¬ 
nal condition for optimality for this problem requires that 
H{x*{tf),u*{tf),X*{tf),tf) = 0. In summary, the terminal 
conditions are: 

r{tf) =rc 
^9{tf)= 0 

a'^ + 2(^api+cos9{tf))Xr{tf) + — l)Xl{tf) — 1 = 0 . 

( 12 ) 


Proposition 1: The Target and Defender optimal control 
headings that maximize the separation between the Target 
and the Attacker and achieve r(f/) = Tc are given by 

. I* ^9 

smip = 




A2/r2 


COS'Ip* = 


Ay 


sin 0 = 


coscp* = 


yAf+AfT^ 

Xe 


1 — Aij 


(13) 


(14) 


(15) 


(16) 


v'(l-Afl)2 + A^/i?2- 

The Attacker optimal control heading that minimizes the 
separation between itself and the Target at t = tf is given 
by 


sinx = 


cosx = 




xs 


Xc 


Vxl + xl 

^ cos 6» -1 


(17) 

(18) 

and Xc = (1 - 


where Xs = (1 — Xr) sin0 — 

Xr) cos 9 -I- ^ sin0 — A^. 

Proof. In order to find the optimal heading angle equations 
involving f}* we solve for this variable by differentiating the 
Hamiltonian (0 in ■0 and setting the derivative to 0 

^TT O 

-—= pXrSiwip ——Xecosip = Q. (19) 

dip r 

Using the trigonometric identity cos^ ip = 1 — sin^ ip we can 
write (O as 


sin"^ Ip = 


'^9 


d^H 


r2(A2 -b A^/r2) 
and (fTST l follows. The expression (fT4li is found in a similar 
way by letting sin^ ip = 1 — cos^ ip in (Hsii. We can compute 
the second partial derivative of the Hamiltonian with respect 
to Ip to show that this solution minimizes the cost — J. Doing 
so we obtain 

= piXr cos'i/' + ^Xg sinip 
- P>'l + ^ n (20) 

yAf+A|77^ 

which means that ip* minimizes the cost — J; equivalently, 

it maximizes the final separation R{tf). 

The optimal heading of the Target can be found in a similar 

way. Let us evaluate 

dH Ot 

= ail-XR)siop-—Xgcos(p = {). (21) 

op R 

We use the trigonometric identity cos^ p = 1 — sin^ p to 
write (EB as 


sin^ p* = 


X2 


i?2((l-Afi)2+A2/i72) 
and we obtain (fTSl) . The expression (fTbl) is found in a similar 
way by setting sin^ p = I — cos^ p in (l2TT i. Similarly, we 
compute 

= a(l - Afi) cos(()-b-gAe sini^ 

a(l —Xh)^ , aXfl 


\/(i-XR)2-rx2/i?2 flV(i-XR)^-rA2/fl2 


> 0 


( 22 ) 
























which means that (p* minimizes the cost — J; equivalently, 
it maximizes the final separation R{tf) 

The optimal heading x* characterized in a similar 
way. We differentiate the Hamiltonian ([Sll in x and set the 
derivative to 0 

H = (1 - Afl) sin(6» - x) + Ar sin x 
- ^ cos(6» - x) + ^ cosx = 0. 

Using the trigonometric identities: 


sin(0 — x) = sin^cosx — cos0sinx (24) 

cos(0 — x) = COS0COSX + sin^sinx (25) 


we can write dSS as follows: 

((1 — Xr) sin0 — ^ cos 9 - 


\s-\ 


-) cosx 

= ((1 — Xr) cos 9 + ^sm9 — Xr) sin x- 

We now use the trigonometric identity cos^ x = 1 
to obtain 


(26) 


sin^ X 


sin X = 

Xt+Xc 

and ( fTTl i follows. The expression ( fTSl l is found in a similar 
way by setting sin^ x = 1 ~ cos^ x in (l26l l. 

In order to guarantee that the Attacker optimal control 
maximizes the objective — J we evaluate the second partial 
derivative of the Hamiltonian with respect to the Attacker 
control input. 


dpH 

dx‘‘ 


= -(1 - Xr)cos{9 - x) + Ar cosx 
- ^sm{9 - x) - v®™X- 


(27) 


Inserting the expressions (l24l i and 
obtain the following 


apH 

dx-‘ 


into eq. we 

-4^)sinx 


= — ((1 — Xr) sin0 — ^ cosi 
— ((1 — Afl) cos 9 + ^ sin 0 — A^) cos x < 0. 

(28) 


Therefore, the solutions ( fTTI i and (fTSl l maximize the objective 
— J, which is equivalent to minimize the terminal separation 
R{tf). □ 

The expressions for the optimal heading angles (fTAt - ifTSl l 
are used to numerically solve the Two-Point Boundary Value 
Problem (TPBVP) (|4|-(|7]i, (l9Tl- (fT8l l. The numerical solution 
is found by substituting the optimal control headings into the 
state equations (IDi-®, and the co-state equations (l9Tl- (fTTl i. 
with the terminal conditions given by (fT2T i. 


IV. Critical Speed Ratio for Target Survival 

In this section we consider point capture, that is, the 
separation r has to satisfy r{tf) 0 in order for the 
Defender to capture the Attacker. The scenario is illustrated 
in Fig. 12] We consider the rotating reference frame anchored 
on the Attacker and the Defender. In Fig. |2]the points A and 
D represent the positions of the Attacker and the Defender, 
respectively. A Cartesian frame is attached to the points A 
and D in such a way that the extension to infinity of AD 
in both directions represents the X-axis and the orthogonal 
bisector of AD represents the V-axis. The positions of the 
three agents in this frame are T = [xt^Vt), A = (xa, 0), 
and D = {—xa, 0). 



With respect to Fig. |2| we note that the Attacker aims 
at minimizing the distance between the Target at the time 
instant when the Defender intercepts the Attacker, point T', 
and point I, where the Defender intercepts the Attacker. The 
points T and T' represent the initial and terminal positions 
of the Target, respectively. 

Define 7 = 1//? = VaIVr. When 7 < 1 the Defender 
will intercept the Attacker at some point I = {xi,yi) that 
lies on the Apollonius circle defined by the Defender and 
the Attacker separation and the speed ratio 7. The center of 
the DA-based Apollonius circle is at (a, 0), where 

1 + 7 ^ 

a = -r^XA (29) 

1 _ 7^ 

and the radius of the DA Apollonius circle is 

27 

ta = (30) 

1 _ 7 ^ 

When the Target is inside the DA Apollonius circle, its 
speed needs to be high enough in order to exit from the 
DA Apollonius circle before being captured by the Attacker. 
If the Target is able to exit the DA Apollonius circle then 
the Defender will be able to assist the Target to escape, by 
intercepting the Attacker who is on route to the Target. 

Proposition 2: Given the speed ratio 7 = Va/Vd < 1, 

the critical speed ratio a is a function of the positions of the 

Target and the Attacker and is given by 

- _ 7 \/ {XA + xt)"^ AVt- \/{x a- xt)'^ + 

(X . t.51J 

2-fXA 

Proof. In order to determine the minimum speed ratio, 
a, that guarantees Target survival we consider a second 
Apollonius circle defined by the Attacker and the Target 
using the Target/Attacker speed ratio a. Thus, a solution to 
the differential game exists if and only if the AT Apollonius 
circle, which is based on the segment AT and the speed ratio 
a, intersects the DA Apollonius circle, the one based on the 
the segment DA and the speed ratio 7. The lower limit a 
on the speed ratio a, that is, a < a < 1 , corresponds to 
the case where the AT Apollonius circle is tangent to the 
DA Apollonius circle, see Fig. |3] Note that if the speed 












Fig. 3. Determination of a 


ratio a > 1 the Target always escapes and there is no need 
for a Defender missile, that is, there is no target defense 
differential game in the first place. 

The Attacker’s initial position, the Target’s initial position, 
and the center O of the AT Apollonius circle are collinear 
and lie on the dotted line in Fig. [3 which can be represented 
as 


y = - 


Vt 

Xa - XT 


X + 


xaVt 

xa-xt' 


The geometry of the second Apollonius circle is as follows: 

The center of the circle, denoted by O, is at a distance of 
2 

from T and its radius is rn = t i d, where d is the 
distance between A and T and is given by 


d = {xA - xtY + Ut- (32) 

Hence, the following holds 

( xtVt Vt , .2 

- Xo] +{Xo-Xt) 

\XA — XT XA — XT ' 

and we calculate the coordinates of the center of the second 
Apollonius circle 

Xo = i _ q ,2 XT — XA ^ 23 ) 

yo = T^yr- 

From Fig. [3 we can see that the three points a, O, and I are 
collinear, where / represents the tangent point where both 
circles meet. Thus, we have the following relationship 


TO =rA - q (34) 

where q = \/{a — xq)^ + J/q. Eq. (l34l i can be written as 
follows 

(a-xof + y!> = (rA--^df. (35) 

Eq. (iTSl l can be expressed in terms of the known positions 
(xt, yT, and xa), the known speed ratio 7 , and the variable 


we aim to solve for, which is a. After a few steps we obtain 
the following quartic equation in a 

-^x\a^ + -^Xao? 

- + -^XAXT - (36) 


47d ™ I 
~ 1 xaoi -r 


47' 


tXaXt 


-d‘^ = 0 


which can be factored out to obtain the quadratic polynomials 

(^A^XACtijXACe + d) + (1 — 7^)d^ — Aj'^xaXt) — l) 

= 0 . 

The solutions a = ±1 are irrelevant to the differential game 
under analysis. Thus, the critical speed ratio a is given by 
the positive solution of the quadratic equation 

47^a:^a^ + 47a:,4da + (1 — 7^)d^ — i^'^XAXT = 0 

which is given by OTT i. □ 

In the particular case where yT = 0, the critical speed 
ratio is given by 

xa{i - 1) + xt{i + 1) 


a = 


2'yxA 


(37) 


Further, the initial Target position (xt,^) for which the 
critical a is equal to zero can be obtained from (iTTl i 

xa{i - 1) + xt(i + 1) 


0 = 


which is equivalent to 

1 + 7 ^ 


2'yxA 

(1 - 'y)xA 


XT = 


1 


■7 


(38) 


a-rA = 


1 — 7' 


:XA - 
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1 — 7 ' 


;XA = 


{ l-"t)XA 
1 + 7 


as expected. 


V. Optimal Strategies 

When 7 < 1 the Defender will intercept the Attacker at 
some point I = {xi,yi) that lies on the DA Apollonius 
circle. Notice that all points outside the DA Apollonius circle 
can be reached by the Defender before the Attacker does; 
similarly, all points inside the same circle can be reached by 
the Attacker before the Defender does. 


A. Target Starts Outside of DA Apollonius Circle 

In the case where the Target is initially outside the DA 
Apollonius circle it can be clearly seen that the Defender 
can help the Target regardless of the speed ratio 0 < a < 
1 because the Attacker cannot reach the Target before the 
Defender does. In other words, the critical speed ratio in 
this case is a = 0. In the case where the Target is outside 
the DA Apollonius circle, the Target chooses point v on the 
DA Apollonius circle (in order to run away from that point) 
and the Attacker chooses his aimpoint u on the same circle. 
Additionally, the Defender tries to intercept the Attacker by 
choosing his aimpoint w, also on the DA Apollonius circle. 
The Target, the Defender, and the Attacker are faced with 
the minmax optimization problem: min„ max„ J{u, v, w), 
where J{u, v,w) = S and S represents the distance between 
the Target terminal position T' and the point on the DA 
Apollonius circle where the Attacker is intercepted by the 
Defender. 

































The Defender helps the Target to escape by intercepting 
the Attacker at the point u on the DA Apollonius circle. 
Therefore, the Defender’s optimal policy is w*{u,v) = u 
in order to guarantee interception of the Attacker. Since 
the Defender’s optimal policy is w* = u we have that the 
decision variables u and v jointly determine J{u,v), where 
J{u,v) = S and S represents the distance between the 
Target terminal position T' and the point 7 = it on the DA 
Apollonius circle where the Attacker is intercepted by the 
Defender. 

Proposition 3: Given the cost/payoff function J{u,v), 
the solution it* and v* of the optimization problem 
min„ max„ J(it, v) is such that 

* * 
u = V . 

Moreover, when the Target is outside the DA Apollonius 
circle, the Target’s strategy is v*{u) = argmax^ J(it, v) = u 
so that it suffices to solve the optimization problem 

min^j^y, J{xi,yi) 

subject to (a — + yj = r\ 

where 

Jjxi.yi) = s/{xi -xtY+ {yi- yrf ™ 
+ a^{xA - xiY + y'j. 

□ 

The Attacker chooses the optimal coordinates {xj,yi) of 
point I that minimize the final separation J{xi, yi) = IT + 
TT' see Fig. H) 

One way to formulate this problem is as shown in Propo¬ 
sition [ 3 ] The equality constraint can be used in the cost to 
write J{xi), that is, to write the cost in terms of only one 
variable. For instance, the first derivative _ q j-gsups 

OXI 

in a sixth order equation in xj. 

Theorem 1: The optimal interception point I that mini¬ 
mizes (l40l i has polar coordinates I = [if* , r. 4 ) with respect to 
the center of the DA Apollonius circle denoted by a, where 


tp* is the solution of the sixth order complex exponential 
equation 


Nr A ! 1 _ _N_\ 6itp 
I a^MlJ^ 

+2{r\ + 7V2 _ 

+NrA{J^i2-P)-2l + \)e^^^ 

+ {i3^)\r\+M^)-r\-N^)D‘P 
+NrAlil-Jj^)=0 

that minimizes the cost 


J{p) = y^r5'^K/V^'^^'2]VrACOs(^^^^ 
-I- a\/r\ + M'^ — 2MrA cos p 


(41) 


(42) 


where I = e , M = jzNpXA represents the distance 
between the points A and a, and N = s/{a — xt)^ + Vt 
represents the distance between the points a and T. 

Proof. An alternative and more compact algebraic equation 
to directly solving (l40l i can be obtained by searching for the 
optimal angle p that minimizes the same cost see Fig. |4l 

It can be seen that, by varying the angle p, the point I 
moves along the circumference of the DA Apollonius circle. 
In order to write an equivalent expression to (l40l) . but only in 
terms of p, we consider the two triangles AaAI and AaTI. 

The distance TT' is proportional to the distance AI. 
The distance AI changes as the angle p takes different 
values. However, the distance al = ta and the distance 
aA = ^2 ‘i are fixed. Similarly, the distance IT changes 
in terms of the angle p, but the distance al, the distance aT, 
and the angle A are fixed. Then, the cost (l40l) can be written 
as in (l42l i 

The first derivative of (l42l i is 

dj((^) _ N s\r\.{tp—X) 


y^r'^-\-N^ — 2NrA cos{(f>—X) 

I _ cxM sin ip _ 

-y/r^+M^—2MrA cos ip 

Setting (l43l l equal to zero we obtain 


(43) 


sin^((^ —A) _ 

r^+A^^—2A^r^ cos(c/? —A) —2MrA cos ip ' 


(44) 


In order to solve for the angle p we use the complex 
exponential to obtain 

IT A) + Mr A -f e”®^)) = 

^^^(e®‘^-e-®^)^(r^-fAf2-ArrA(e®(‘^-^) + e-Av-D)y 

(45) 


After some manipulation we obtain a sixth order polynomial 
equation in e®”^ as it is shown in (|4TI) . 

The polynomial in (l4TI) has complex coefficients. The six 
solutions of (HTt are complex, in general, of the form e®"^ = 
cos p + i sin p. Thus, the angle p can be directly obtained. 
In the worst case, we only need to test the six angles in the 
cost function to determine the optimal solution p*. □ 


B. Target Starts Inside of DA Apollonius Circle 

In the case where the Target is inside the DA Apollo¬ 
nius circle, the Target chooses his aimpoint v on the DA 
Apollonius circle and the Attacker chooses his aimpoint 
u on the same circle. Additionally, the Defender tries to 
























intercept the Attacker by choosing his aimpoint w, also 
on the DA Apollonius circle. The Target, the Defender, 
and the Attacker are faced with the maxmin optimization 
problem: max^^^j minu J(u, w), where J{u,v,w) = S 
and S represents the distance between the Target terminal 
position T' and the point on the DA Apollonius circle where 
the Attacker is intercepted by the Defender. 

The Defender’s optimal policy is w*{u,v) = m in order 
to guarantee interception of the Attacker and the decision 
variables u and v jointly determine J{u,v). Now, let us 
analyze the possible strategies. If the Target chooses v, the 
Attacker will respond and choose u. If u ^ v the Target 
would correct his decision and choose some v such that 
5 > 5” as shown in Fig. m In general, choosing u ^ v 
is detrimental to the Attacker since the resulting cost will 
increase. Thus, it is clear that the Attacker should aim at the 
point V which is chosen by the Target. 

Proposition 4: Given the cost/payoff function J{u,v), 
the solution u* and v* of the optimization problem 
max„ minu J(u,,w) is such that 

* * 

U = V . 

Moreover, when the Target is inside the DA Apollonius 
circle, the Attacker’s strategy is u*{v) = argmin„ J{u,v) = 
V so that it suffices to solve the optimization problem 
J{xi,yi) 

subject to {a — xi)^ + yj = 

where 

Jjxi^yi) = a^y jxA- xi)'^ Ayj _ 

□ 


The difference between the Target being inside or outside 
the DA Apollonius circle is not only the sign in the cost 
function but the Target and Attacker strategies. In the case 
treated in this subsection, the Target chooses the coordinates 
{xj, yi) that maximize the final separation J{xi,yj) and the 
Attacker follows the Target’s decision. Additionally, when 
the Target is inside the DA Apollonius circle its critical 
speed is Of > 0 (given by (OTl il: when the Target is outside 
the same circle then its critical speed is a = 0. 

Theorem 2: The optimal interception point I that maxi¬ 
mizes Wt\ has polar coordinates I = {p* ^ta) with respect to 
the center of the DA Apollonius circle denoted by a, where 
if* is the solution of the sixth order complex exponential 
equation (HTI) that maximizes the cost 

J{ip) = ay/ r\ + M‘^ - 2MrA cos p 

— \/r\+ — 2NrA cos((/3 — A) 


where / = M represents the distance between the points 
A and a, and N represents the distance between the points 
a and T. 

Proof. The cost (l47l) can be written in terms of the angle 
tp as in (|48]) . The first derivative of (l48l) is 

dj (y?) _ _ oiM sin c/; _ 

^r\+M'^-2MrA cos 93 , 

N sin{(p — X) 

■y/r^+A^^—27VrA cos(c^ —A) 


Y 



Fig. 5. maxmin optimization problem 


Setting (|49] | equal to zero we obtain (l44l) and, consequently, 
the optimal angle p* is the solution of (HTt that maximizes 
gSll. □ 


VI. Examples 


Example 1. Target is outside the DA Apollonius circle. The 
speed ratios are a = 0.25 and 7 = 0.8. The initial conditions 
of the three agents are given by: A = (4,0), D = (—4,0), 
and T = (0.5,4). We calculate a = 18.22 and ta = 17.78. 
The six solutions of (|4TI) are given by 

ipi = -2.9596 
ip2 = -2.8573 
7)3 = 0.0001 
7)4 = 0.0001 
715 = 0.2254 

PQ = 0.2186. 

By evaluating these solutions using (l42l) we have that the 
optimal solution is ip* = 0.2186, which yields I* = 
(0.8676,3.8555). The trajectories are shown in Fig. [b] Note 
that the same trajectories and optimal interception point are 
obtained by using the numerical method from Sec. HIJ 

Example 2. Target is inside the DA Apollonius circle. The 
speed ratios are a = 0.5 and 7 = 0.93. The initial conditions 
of the three agents are given by: A = (6,0), D = (—6,0), 
and T = (3.1,2.7). In this case we calculate a = 82.823 
and TA = 82.605. Since the Target is initially inside the 
DA Apollonius circle, the critical speed ratio a is greater 
than zero. We can use eq. (EB to find the exact value of 
the critical speed ratio which is a = 0.436. Thus, the value 
a = 0.5 > a guarantees the Target’s escape. Now we can 
search for the optimal angle p* that solves the differential 
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Fig. 6. Optimal trajectories in Example 1 
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Fig. 7. Optimal trajectories in Example 2 


game. The six solutions of eq. (EB are given by 
= -3.0752 
<^2 = -3.1189 
<^3 = -0.0014 
<^4 = -0.0014 
= 0.0277 
ipQ = 0.0429. 

By evaluating these solutions using eq. (l48l) we have that 
the optimal solution is ip* = 0.0429, which yields /* = 
(0.293, 3.539). The trajectories are shown in Fig. |2l Note 
that the same trajectories and optimal interception point are 
obtained by using the numerical method from Sec. HIJ 
Example 3. Robustness to unknown Attacker guidance 
law. A very important characteristic of the cooperative 
guidance laws for the active target defense differential game 
as discussed in this paper is that the solution given by 
the sixth order equation (l4TT i is a closed-loop interception 
strategy that is robust to unknown Attacker guidance laws. 
This means that if the Attacker does not follow its optimal 



Fig. 8. Trajectories in Example 3 


policy and uses a different guidance law that is unknown to 
the Target-Defender team then the Target and the Defender 
(having current measurements of the Attacker’ position) are 
able to solve (l4Tl i and continuously update their cooperative 
interception strategy, thus increasing the T — A separation at 
interception time. 

Let the initial positions of the three agents be: A = (10,0), 
D = (—10,0), and T = (3, 7.5). The speed ratios are a = 
0.6 and 7 = 0.85. The Attacker implements PN guidance 
law with navigation constant N=3. However, this information 
is unknown to the Target-Defender team and they are only 
able to measure the current position of the Attacker, A = 
{xA{t), yA{t))- By continuously updating their headings, the 
Target-Defender team are able to defeat the Attacker, that is, 
the Defender intercepts the Attacker and the Target escapes 
being captured by the Attacker. The trajectories for this 
example are shown in Fig. | 8 ] The final separation between 
Target and Attacker is Ritf) = 5.609 > J*. As expected, 
the final separation is more than if the Attacker played 
optimally. When the Attacker plays optimally the cost/payoff 
is J* = 5.373. 

VII. Conclusions 

A numerical and an analytical solution to the active 
target defense differential game with a fast Defender were 
presented in this paper. The numerical solution is based 
on the Pontryagin’s Maximum Principle applied to the dif¬ 
ferential game and a TPBVP is solved numerically. The 
analytical approach hinges on the solutions of a sixth-order 
polynomial equation that provides the optimal interception 
point’s coordinates, hence it provides the optimal headings 
for the players. This result comes with an expected increase 
in complexity compared to the case where both missiles are 
restricted to have the same speed [23]. In that case, the 
solution of the differential game required the rooting of a 
fourth-order polynomial. 
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